How to Use SAS for GMM Logistic Regression Models for Longitudinal Data with Time-Dependent Covariates

نویسندگان

  • Katherine Cai
  • Jeffrey Wilson
چکیده

In longitudinal data, it is important to account for the correlation due to repeated measures and timedependent covariates. Generalized method of moments can be used to estimate the coefficients in longitudinal data, although there are currently limited procedures in SAS ® to produce GMM estimates for correlated data. In a recent paper, Lalonde, Wilson, and Yin provided a GMM model for estimating the coefficients in this type of data. SAS PROC IML was used to generate equations that needed to be solved to determine which estimating equations to use. In addition, this study extended classifications of moment conditions to include a type IV covariate. Two data sets were evaluated using this method, including rehospitalization rates from a Medicare database as well as body mass index and future morbidity rates among Filipino children. Both examples contain binary responses, repeated measures, and timedependent covariates. However, while this technique is useful, it is tedious and can also be complicated when determining the matrices necessary to obtain the estimating equations. We provide a concise and user-friendly macro to fit GMM logistic regression models with extended classifications. INTRODUCTION Longitudinal studies focus on a particular outcome measured across time. While studies investigating changes over time are very useful, multiple measurements collected on a single subject can result in correlated observations. The correlation can affect the standard errors in the estimation processes as the variation within a particular subject is likely to be much smaller than the variation between subjects. Moreover, time-dependent covariates present some additional challenges in working with longitudinal modeling. In particular, some predictors can change over time due to feedback from the response, and need to be accounted for in the modeling process. In turn, the change in predictors can impact the response. SAS currently offers procedures, which utilize the statistical methods of generalized estimating equations (GEE) and generalized linear mixed models (GLMM), to analyze longitudinal data with binary outcomes. A macro that performs generalized method of moments (GMM) logistic regression is presented, which can appropriately take into account the correlation between covariate values. The use of the GMM macro is illustrated and compared to the results to SAS PROC GENMOD and PROC GLIMMIX. The three methods are demonstrated through the analysis of a body mass index (BMI) and morbidity dataset collected in the Philippines. THE DATA The data were collected by the International Food Policy Research Institute in the Bukidnon Province in the Philippines. BMI and morbidity were measured for 370 children at three separate time points, separated by 4-month intervals. The purpose of this study was to predict morbidity for children over time based on various factors. The dataset contains a total of 1,110 observations, with three different BMI measurements for each of the 370 children. For each of the children (labeled by childID), the visit number (time) and body mass index (BMI) were recorded. For each of the three visits, it was recorded whether the child was sick (sick = 1) or healthy (sick = 0) at the time of measurement. Although additional information was collected in the study, we wish to predict morbidity based on the visit number and the child’s BMI. The SAS data set was created using the following code: data Morbidity; input childID BMI time sick; datalines; 206 14.95059 1 0

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

SAS Macro for Generalized Method of Moments Estimation for Longitudinal Data with Time-Dependent Covariates

Longitudinal data with time-dependent covariates is not readily analyzed as there are inherent, complex correlations due to the repeated measurements on the sampling unit and the feedback process between the covariates in one time period and the response in another. A generalized method of moments (GMM) logistic regression model (Lalonde, Wilson, and Yin 2014) is one method to analyze such corr...

متن کامل

کاربرد مدل توأم بقا و داده های طولی در بیماران دیالیز صفاقی

Background and Aim: In many medical studies along with longitudinal data, which are repeatedly measured during a certain time period, survival data are also recorded. In these situations, using models such as, mixed effects models or GEE method for longitudinal data and Cox model for survival data, are not appropriate because some necessary assumptions are not met. Instead, the joint models hav...

متن کامل

184-31: Fixed Effects Regression Methods in SAS®

Fixed effects regression methods are used to analyze longitudinal data with repeated measures on both independent and dependent variables. They have the attractive feature of controlling for all stable characteristics of the individuals, whether measured or not. This is accomplished by using only within-individual variation to estimate the regression coefficients. This paper surveys the wide va...

متن کامل

Prediction of mental disorders after Mild Traumatic Brain Injury: principle component Approach

Introduction: In Processes Modeling, when there is relatively a high correlation between covariates, multicollinearity is created, and it leads to reduction in model's efficiency. In this study, by using principle component analysis, modification of the effect of multicolinearity in Artificial Neural Network (ANN) and Logistic Regression (LR) has been studied. Also, the effect of multicolineari...

متن کامل

بررسی عوامل موثر بر زمان بقاء بیماران لوسمی حاد بعد از پیوند مغز استخوان با استفاده از مدل‌های چندحالتی نیمه مارکفی در بیمارستان شریعتی تهران

Abstract Background: Semi-Markov multi-state models are very important to describe regression and progression in chronic diseases and cancers. Purpose of this research was to determine the prognostic factors for survival after acute leukemia using multi-state models. Materials and Methods: In this descriptive longitudinal research, a total of 507 acute leukemia patients (206 acute lymphocyt...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015